Attribute Weighting via Differential Evolution Algorithm for Attribute Weighted Naive Bayes (WNB)
نویسندگان
چکیده
The naive Bayes (NB) is a popular classification technique for data mining and machine learning, which is based on the attribute independence assumption. Researchers have proposed out many effective methods to improve the performance of NB by lowering its primary weakness---the assumption that attributes are independent given the class, such as backwards sequential elimination method, lazy elimination method and so on. Recently, Mark Hall presents a simple filter method for setting attribute weights for naive Bayes and proposes a decision tree-based attribute weighted method. In his paper, the experimental results show that the new weighted method performs better than other weighted methods. That weighting idea is taken as the objective of our study in which we use differential evolution algorithms to determine the weights of attributes and then use these weights in our previously developed Weighted Naïve Bayes (WNB). We evaluate the performance of new configuration (DE-WNB) on the whole 36 standard UCI data sets in Weka system. We also compare it with the decision tree-based attribute weighted methods and other methods mentioned in Mark Hall’ paper for those data sets. Our experimental results show that the classification accuracy of our new algorithm DE-WNB is much higher than those of the other algorithms used to compare. The obtained classification accuracy is very good with respect to other common WNB classifiers in literature.
منابع مشابه
Alleviating naive Bayes attribute independence assumption by attribute weighting
Despite the simplicity of the Naive Bayes classifier, it has continued to perform well against more sophisticated newcomers and has remained, therefore, of great interest to the machine learning community. Of numerous approaches to refining the naive Bayes classifier, attribute weighting has received less attention than it warrants. Most approaches, perhaps influenced by attribute weighting in ...
متن کاملAttribute Weighting via Differential Evolution for Attribute Weighted Clonal Selection Algorithm
Classification is an important technology in data mining, while clonal selection algorithm (CSA) is a very effective classification method. Although CSA brings a new effective tool for solving complex problems, we can not completely say that it over-performs to other algorithms especially in the classification field. A main problem of CSA classifier is that it does not carry attribute imbalance...
متن کاملAttribute Value Weighted Average of One-Dependence Estimators
Of numerous proposals to improve the accuracy of naive Bayes by weakening its attribute independence assumption, semi-naive Bayesian classifiers which utilize one-dependence estimators (ODEs) have been shown to be able to approximate the ground-truth attribute dependencies; meanwhile, the probability estimation in ODEs is effective, thus leading to excellent performance. In previous studies, OD...
متن کاملAnalysis of sequential physiology data with weighted naive Bayes
In this project, I describe how I address the ICML 2004 Physiological Data Modeling Contest. For the gender prediction task, I compressed the large entry-based dataset to small session-based dataset and manually devised 90 features using a histogram method. Weighted naive Bayes (WNB) which is an extension of naive Bayes was applied and Markov Chain Monte Carlo was combined to solve the weight u...
متن کاملNaive Bayes Spam Filtering Using Word Position Attributes
This paper explores the use of the naive Bayes classifier as the basis for personalized spam filters. Various machine learning algorithms, including variants of naive Bayes, have previously been used for this purpose, but the author’s implementation using word position based attribute vectors gives very good results when tested on several publicly available corpora. The effect of various forms ...
متن کامل